Expansion of the First Hindi-Nepali Word-Net Based Bi-Lingual Dictionary and the advancement of the Human-Machine Interface

نویسندگان

  • Shantanu Kar
  • Alok Chakrabarty
چکیده

Natural Language Processing is introducing a new era in the field of Computer Science and Machine translation. HumanMachine interaction is to play a very important role in the coming centuries as the dependency of human over the machine is increasing variably. Word-Net was first introduced by Miller and Fellbaum in 1985. WordNet is a Lexical database for the Human Languages. It groups the Human Language into sets of synonyms called “synsets” which provides short, general definitions, and records the various semantic relations between these synonym sets. Word-Net based Bi-Lingual Dictionary (Started in January, 2009) is a part of the Nepali WordNet Project under IndoWordNet Project. It is an approach towards the construction and application of Multilingual Indo-WordNet, which is a pioneering effort on Nepali to Hindi and vice versa translation. The making of the Hindi-Nepali Word-Net was a challenge to us as the machine has to translate the sense of each word to produce the expected result. One of the major reasons behind this is the nonavailability of rich lexical re-sources in Hindi and Nepali. The Hindi-Nepali WordNet based dictionary is thus an endeavor to prepare a rich lexical resource for the Hindi and Nepali Languages for effective machine translation and to facilitate the development of Information and Communication Technologies in Hindi-Nepali. The endeavor is inspired by the famous English WordNet, Hindi Word-Net and the Nepali Word-Net (Nepali Word-Net is also, created by us as a research and project work at Assam University, Silchar as a part of the Indo-Word-Net Project). In this paper we have discussed about the expansion approach of first Hindi-Nepali WordNet based Bi-Lingual Dictionary, the linguistic challenges involved, Word-Net creation tool interface and the synsets‟ storage structure.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improvement of the Results of Statistical Machine Translation System using Anusaaraka

This paper describes an efficient experimental approach for the improvement of translation quality of phrase based statistical machine translation system by utilizing the insights of the rule based machine translation. As the most primitive step it is believed that appending large and accurately designed linguistic resources such as multiword bilingual dictionaries to the existing training corp...

متن کامل

Some Challenges of Automated Annotation in A Multilingual Scenario

A key ingredient of today’s NLP scenario is annotation and this paper discusses challenges involved in one of the toughest annotation tasks which is sense marking. A large amount of data needs to be sense marked accurately by human annotators in order to train the machine to understand the spoken languages. The sense marked corpus for various languages facilitate the task of Word Sense Disambig...

متن کامل

Hindi and Marathi to English Cross Language Information Retrieval at CLEF 2007

In this paper, we present our Hindi ->English and Marathi ->English CLIR systems developed as part of our participation in the CLEF 2007 Ad-Hoc Bilingual task. We take a query translation based approach using bi-lingual dictionaries. Query words not found in the dictionary are transliterated using a simple lookup table based transliteration approach. The resultant transliteration is then compar...

متن کامل

Hindi and Marathi to English Cross Language Information

In this paper, we present our Hindi ->English and Marathi ->English CLIR systems developed as part of our participation in the CLEF 2007 Ad-Hoc Bilingual task. We take a query translation based approach using bi-lingual dictionaries. Query words not found in the dictionary are transliterated using a simple lookup table based transliteration approach. The resultant transliteration is then compar...

متن کامل

Bi-level Model for Reliability based Maintenance and Job Scheduling

Many defects in manufacturing system are caused by human resources that show the significance of the human resources in manufacturing systems. Most manufacturers attempt to investigate the human resources in order to improve the work conditions and reduce the human error by providing a proper work-rest schedule. On the other hand, manufacturer deal with machine scheduling based on demand and wo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012